Goto

Collaborating Authors

 optimal hyperplane


Machine Learning: Progress and Prospects

arXiv.org Artificial Intelligence

This Inaugural Lecture was given at Royal Holloway University of London in 1996. It covers an introduction to machine learning and describes various theoretical advances and practical projects in the field. The Lecture here is presented in its original format, but a few remarks have been added in 2025 to reflect recent developments, and the list of references has been updated to enhance the convenience and accuracy for readers. When did machine learning start? Maybe a good starting point is 1949, when Claude Shannon proposed a learning algorithm for chess-playing programs. Or maybe we should go back to the 1930s when Ronald Fisher developed discriminant analysis - a type of learning where the problem is to construct a decision rule that separates two types of vectors. Or could it be the 18th century when David Hume discussed the idea of induction? Or the 14th century, when William of Ockham formulated the principle of "simplicity" known as "Ockham's razor" (Ockham, by the way, is a small village not far from Royal Holloway). Or it may be that, like almost everything else in Western civilisation and culture, the origin of these ideas lies in the Mediterranean. After all, it was Aristotle who said that "we learn some things only by doing things". The field of machine learning has been greatly influenced by other disciplines and the subject is in itself not a very homogeneous discipline, but includes separate, overlapping subfields. There are many parallel lines of research in ML: inductive learning, neural networks, clustering, and theories of learning. They are all part of the more general field of machine learning.


Kernel Methods for Implicit Surface Modeling

Neural Information Processing Systems

We describe methods for computing an implicit model of a hypersurface that is given only by a finite sampling. The methods work by mapping the sample points into a reproducing kernel Hilbert space and then deter- mining regions in terms of hyperplanes. Suppose we are given a finite sampling (in machine learning terms, training data) x1, . . . The case d 3 is especially interesting since these days there are many devices, e.g., laser range scanners, that allow the acquisition of point data from the boundary surfaces of solids. For further processing it is often necessary to transform this data into a continu- ous model. Today the most popular approach is to add connectivity information to the data by transforming them into a triangle mesh (see [4] for an example of such a transformation algorithm).


Hyperplane bounds for neural feature mappings

arXiv.org Artificial Intelligence

When minimising the empirical risk, the generalisation of the learnt function still depends on the performance on the training data, the Vapnik-Chervonenkis(VC)- dimension of the function and the number of training examples. Neural networks have a large number of parameters, which correlates with their VC-dimension that is typically large but not infinite, and typically a large number of training instances are needed to effectively train them. In this work, we explore how to optimize feature mappings using neural network with the intention to reduce the effective VC-dimension of the hyperplane found in the space generatedby the mapping. An interpretationofthe resultsofthis study isthat it ispossible to define a loss that controls the VC-dimension of the separating hyperplane. We evaluate this approach and observe that the performance when using this method improves when the size of the training set is small.


Support Vector Machines, Illustrated

#artificialintelligence

Support vector machines are a class of techniques in data science, which had great popularity in the data science community. They are mainly used in classification tasks and perform really well when few training data is available. Sadly, SVMs have been almost forgotten lately due to the massive popularity of deep learning. But I my opinion they are a tool that every data scientist should have in their toolbox, because they are faster to train and sometimes even outperform neural networks. In this blog, you will learn that SVMs use hyperplanes to separate and classify our data.


Support Vector Machines, Illustrated

#artificialintelligence

Support vector machines are a class of techniques in data science, which had great popularity in the data science community. They are mainly used in classification tasks and perform really well when few training data is available. Sadly, SVMs have been almost forgotten lately due to the massive popularity of deep learning. But I my opinion they are a tool that every data scientist should have in their toolbox, because they are faster to train and sometimes even outperform neural networks. In this blog, you will learn that SVMs use hyperplanes to separate and classify our data.


Support Vector Machines -- the basics

#artificialintelligence

The important job that SVM's perform is to find a decision boundary to classify our data. This decision boundary is also called the hyperplane. Lets start with an example to explain it. Visually, if you look at figure 1, you will see that it makes sense for purple line to be a better hyperplane than the black line. The black line will also do the job, but skates a little to close to one of the red points to make it a good decision line.


Support Vector Machines, Dual Formulation, Quadratic Programming & Sequential Minimal Optimization

#artificialintelligence

The Support-vector Machine (or called Support-vector Networks initially by the author -- Vladimir Vapnik) takes a completely different approach to solving statistical problems (in specific Classification). This algorithm has been heavily used in several classification problems like Image Classification, Bag-of-Words Classifier, OCR, Cancer prediction, and many more. SVM is basically a binary classifier, although it can be modified for multi-class classification as well as regression. Unlike logistic regression and other neural network models, SVMs try to maximize the separation between two classes of points. A brilliant idea is used by the author.


Behavioral analysis of support vector machine classifier with Gaussian kernel and imbalanced data

arXiv.org Machine Learning

The parameters of support vector machines (SVMs) such as the penalty parameter and the kernel parameters have a great impact on the classification accuracy and the complexity of the SVM model. Therefore, the model selection in SVM involves the tuning of these parameters. However, these parameters are usually tuned and used as a black box, without understanding the mathematical background or internal details. In this paper, the behavior of the SVM classification model is analyzed when these parameters take different values with balanced and imbalanced data. This analysis including visualization, mathematical and geometrical interpretations and illustrative numerical examples with the aim of providing the basics of the Gaussian and linear kernel functions with SVM. From this analysis, we proposed a novel search algorithm. In this algorithm, we search for the optimal SVM parameters into two one-dimensional spaces instead of searching into one two-dimensional space. This reduces the computational time significantly. Moreover, in our algorithm, from the analysis of the data, the range of kernel function can be expected. This also reduces the search space and hence reduces the required computational time. Different experiments were conducted to evaluate our search algorithm using different balanced and imbalanced datasets. The results demonstrated how the proposed strategy is fast and effective than other searching strategies.


Supporting the Math Behind Supporting Vector Machines!

#artificialintelligence

Support Vector Machine(SVM) is a powerful classifier that works with both linear and non-linear data. If you have a n-dimensional space, then the dimension of the hyperplane will be (n-1). The goal of SVM is to find an optimal hyperplane that best separates our data so that distance from the nearest points in space to itself is maximized. To keep it simple, consider a road, which separates the left, right-side cars, buildings, pedestrians and makes the widest lane as possible. And those cars, buildings, really close to the street are the support vectors.


CODEBUG

#artificialintelligence

Support vector machine (SVM) is a supervised machine learning algorithm which is considered effective tool for both classification and regression problem. In a simple word, SVM tries to find a linearly separable hyperplane in order to separate members of one class from another. If SVM can not find the hyperplane for a given data set, it applies non-linear mapping to the training data and transform them to higher dimension where it searches for the optimal hyperplane. The SVM algorithm uses support vectors and margins in order to draw these hyperplanes in the training data. Since it has ability to understand the complex relation in input data by applying nonlinear mapping, it has high accuracy compare to other supervised classification algorithms (kNN, NCC..) People have been using SVM for different applications like: text data classification, image data(handwritten) recognition and more.